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It is shown that different distinguishability measures impose different orderings on ensembles of N 
pure quantum states. This is demonstrated using ensembles of equally-probable, linearly indepen- 
dent, symmetrical pure states, with the maximum probabilities of correct hypothesis testing and 
unambiguous state discrimination being the distinguishability measures. This finding implies that 
there is no absolute scale for comparing the distinguishability of any two ensembles of N quantum 
states, and that distinguishability comparison is necessarily relative to a particular discrimination 
strategy. 
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I. INTRODUCTION 

The use of quantum channels to send classical informa- 
tion has many advantages over the use of classical chan- 
nels. One of these is the fact that it enables one to estab- 
lish cryptographic keys in a way that is provably secure, 
a feat which has never been achieved classically. Devel- 
opments such as this have led to renewed attention being 
given to the problem of distinguishing between quantum 
states |Q. In state discrimination, we are faced with the 
following situation: a quantum system is prepared in one 
of N states. For the sake of simplicity, we will take these 
to be pure states where j = 1, . . ., N. The a priori 
probability of the state of the system being is pj . We 
would like to determine which state has been prepared. 
Unless the states are orthogonal we cannot determine 
the state perfectly. We are then faced with the prob- 
lem of devising a strategy which discriminates between 
the N potential states as well as possible. This will in- 
volve some, possibly generalised, quantum measurement. 
This measurement should be optimised and the result- 
ing figure of merit, which is typically a probability, can 
be regarded as a measure of the distinguishability of the 
states with these a priori probabilities. A set of quan- 
tum states \tpj) considered together with their a priori 
probabilities pj forms an ensemble £ = £(j)j,\ipj)). So, 
distinguishability measures will refer to ensembles. 

We shall denote a generic distinguishability measure by 
-D[£]. Several distinct measures are in common use for 
quantifying the distinguishability of states Q. The ques- 
tion we address in this paper is: do all distinguishability 
measures impose the same ordering on ensembles of N 
quantum states? That is, suppose that we have two en- 
sembles, £\ and £ 2 , and two distinguishability measures, 
D\ and Z?2- Then, if £q[£i] > D\\£^\ % is it the case that 
P > 2[£i]>D 2 [£2]^ We shall see that this is not necessarily 
so. 

The distinguishability measures we choose are the 
maximum probabilities of correct hypothesis testing and 
unambiguous state discrimination. For the sake of sim- 
plicity, we restrict our attention to ensembles of states 



which have equal a priori probabilities. We show that 
for two equally-probable pure states, this effect cannot 
be demonstrated for these distinguishability measures, 
which leads us to consider ensembles of N > 2 states. 
We restrict our attention to linearly independent ensem- 
bles, since this is a requirement for unambiguous state 
discrimination The states in the chosen ensembles 
are taken to form symmetrical sets. This is done to take 
advantage of the fact that for such ensembles, our distin- 
guishability measures can be calculated analytically for 
equal a priori probabilities. The effect we have described 
is shown to occur for N = 3. We conclude with a discus- 
sion of the connection between this effect and a related 
one recently discovered by Jozsa and Schlicnz|p[. 



II. DISTINGUISHABILITY MEASURES 
IMPOSING DIFFERENT ENSEMBLE 
ORDERINGS 

The distinguishability measures we choose are the 
maximum probabilities of correct hypothesis testing and 
unambiguous state discrimination. In hypothesis testing 
among N states in the ensemble £(j)j, we consider 

an N outcome generalised measurement, in which the 
jth outcome is associated with the POVM element Ej. 
Our hypothesis is that the outcome of the measurement 
corresponds exactly to the state. If the state is \ifij) and 
outcome j is obtained, then the hypothesis is correct. If 
however, outcome j'^j is obtained, then the hypothesis 
will be incorrect. The maximum probability Phyp that 
our hypothesis is correct is[0 



P 



HYP 



(2.1) 



where the maximisation is carried out over all sets of N 
positive operators Ej such that • E 3 ■ — 1. We shall use 
the maximum probability of correct hypothesis testing, 
Phyp{£), as a measure of the distinguishability of the 
ensemble £. 
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In unambiguous state discrimination, there are only 
two possible outcomes for the state \tpj}'- outcome j and 
a further inconclusive result '?'. There are no errors. 
Unlike hypothesis testing, unambiguous discrimination 
is only possible for linearly independent setsQ and it 
is to such sets that we will restrict our attention. If 
PuSD(\ipj)) is the probability that, given the initial state 
was \tpj) , we obtain a conclusive identification rather than 
an inconclusive result, then the maximum probability of 
unambiguous state discrimination is 

Pusd(£) = max VftPE/Sfld^)), (2.2) 

{PuSD(\lf'j))} ~ 

where the extremisation with respect to the Pusod^j)) 
is discussed by Duan and GuoQ. The distinguishability 
measure we will use for this strategy will be the maximum 
probability Pusd{£) of unambiguous determination of 
the state. 

Here, we will show that there exist ensemble pairs for 
which 

Pusd{£2) < Pusd(£i), (2.3) 
Phyp{S 2 ) > P HY p{£i). (2.4) 

The effect we wish to demonstrate does not occur for an 
ensemble of just two equally-probable pure states. This 
can be seen by examining the values of Phyp and Pusd 
for a pair of pure states, \ipi) and \ip2)- For later conve- 
nience, we will give expressions for these with arbitrary 
a priori probabilities p\ and p2- The former is given by 
Helstroni's boundQ: 

Phyp = \(^+ V 7 ! - (1 - A 2 )|(^ 2 )| 2 ) , (2.5) 

where A = \p\ — p%\. Also, the maximum value of Pusd 
for a pair of pure states is given by the Jaeger-Shimony 
bound § 

(2.6) 

When the a priori p robabilities are equal, A = and 
Phyp = (1/2) (1 + y/1 - (1 - (1 - Pusd) 2 )- From this, 
one can show that Phyp is an increasing function of 
Pusd, implying that inequalities (2.3) and (2.4) can 
never be simultaneously satisfied. 

To find ensembles of equally-probable pure states for 
which both (2.3) and (2.4) are true, we have to consider 
at least three states. We focus on ensembles of equally- 
probable, linearly independent, symmetrical states, since 
the maximum probabilities for unambiguous discrimina- 
tion and correct hypothesis testing can be calculated ex- 
plicitly for these. We will demonstrate the existence of 
ensemble pairs £\ and £2 which satisfy inequalities (2.3) 
and (2.4), in the following way. Firstly, we will consider 



all sets of N linearly independent, symmetric states with 
equal a priori probabilities which have the same, arbi- 
trary but fixed, value of Pjjsd- Over this set, we will find 
the extremal values of Phyp- Using this information, 
we choose ensembles pairs which satisfy inequality (2.3), 
but where £ 1 and £2 have respectively the minimum and 
maximum values of Phy p for their corresponding values 
of Pusd- We will find that, for N = 3, inequality (2.4) 
is satisfied for a large range of parameters. 

The N pure states \ipj), where j = 0, . . ., N — 1, are 
linearly independent and symmetric if and only if they 
can be written as 

N-l 

= X) c re^M, (2.7) 

r=0 

for some N orthonormal states \x r ) and non-zero complex 
coefficients c r satisfying X)rl c >~| 2 = F Notice that the 
phase of c r may be absorbed by \x r ), which implies that 
we may, without loss of generality, take c r to be real and 
positive, which we shall. We will take all states to have 
equal a priori probabilities pj — 1/N. The maximum 
unambiguous discrimination probability for these states 
is@ 

Pusd = Nxmmc 2 r . (2.8) 

r 

The optimum hypothesis testing strategy uses the so- 
called 'square-root' measurement Q . Define the operator 

j 

and the states 

| Wi >=*-Va|^.). (2.10) 

One can quite easily show that the operators E r — 
\uj r ) (u r \ form a POVM (i.e., that E r >0 and £ r E r = 1.) 
This POVM is the optimal hypothesis testing strategy, 
and we find that 




(2-11) 



The square-root measurement optimally discriminates 
between any set of equally-probable, symmetric states, 
even if they are not linearly independent. This measure- 
ment has recently been carried out[^| ^| for symmetrical 
optical polarisation states. Applications of this measure- 
ment to quantum key distribution are discussed injl0[. 

We now calculate the global extrema of Phyp for a 
fixed value of Pusd- Our aim is to fix the smallest of 
the Cr, which is equivalent to fixing Pjjsd, and vary the 
remaining coefficients to obtain the extremal values of 
Phyp- We may let min r c r = cq. This allows us to write 

c r = c + cos 2 6> r , (2.12) 
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for r = 1 , . . . , N — 1 and some angles Oj . We will 
now extremise Phyp with respect to these angles us- 
ing the method of Lagrange multipliers, in order to 
take into account the normalisation constraint. This 
method will yield the local extrema, over which we will 
subsequently optimise to find the global extrema. Let 
G = (X^^o 1 c r) ~ 1- The constrained, local extrema of 
Phyp occur where 



OPhyp 



dG 



(2.13) 



and A is our Lagrange multiplier. Inserting Eq. (2.12) 
into the definitions of Phyp and G, we find that this 
becomes 



sin^cos^r 



Co 



1 

N 



N-l 

E 

r'=l 



COS 



■e r > - a(c + 



cos 



= 0. 



(2.14) 

It is a simple matter to show that the normalisation re- 
quirement will be violated if sin# r = for any r. So, for 
each r, either cos6* r = 0, in which case c r = cq, or the 
term in brackets is zero. To proceed, we will partition 
the c r into two sets, where each set corresponds to one 
of these two possibilities. Let there be No coefficients, 
including cq, which are equal to cq. For the remaining 
N — N coefficients, the term in brackets is zero. All 
of the coefficients in this latter set must also be equal, 
as can be seen from the fact that the vanishing of the 
bracket term implies that the corresponding cos 2 # r are 
equal. The value that they have is easily deduced from 
the fact that the other No coefficients are equal to Co, 
and normalisation. We find they have the value 



1-N c 2 
N -No ' 



(2.15) 



We can now calculate the local extrema of Phy p ■ Eval- 
uating Eq. (2.11) where No coefficients are equal to c , 
with the remaining N—No coefficients being given by Eq. 
(2.15), and making use of the fact that c = \/PusdJN, 
we find that the local extrema of Phyp for fixed Pusd 



Phyp = ^ ( n ^ p usd + \W- 



N )(N- 



NoPusd) 
(2.16) 

Finding the global extrema of Phy p amounts to extremi- 
sation of this expression with respect to No- One method 
of doing this is to treat No as a continuous parameter 
in the interval [l,N — 1] and differentiate Eq. (2.16) 
with respect to it. After some algebra, we find that 
dP H Yp/dN o <0, for P HY p given by Eq. (2.16). This im- 
plies that the global maximum and minimum of Phyp 
occur at the minimum and maximum values of No re- 
spectively. Thus, the maximum value of Phyp occurs at 
No = 1, which gives us the tight upper bound 



Phyp<^ (VPusd + V(N-l)(N-Pu SD ) 



HYP 
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FIG. 1: Contour plot of the ratio of the maximum prob- 
abilities of correct hypothesis testing for ensembles £1 and 
£2 of three linearly independent, equally- probable, symmet- 
ric states, versus the maximum unambiguous discrimina- 
tion probability Pusd(£i) of £\ and the parameter e = 
Pusd{£i) — Pusd{£2}- This parameter is positive which 
implies that, except at the origin, inequality (2.3) is sat- 
isfied. Inequality (2.4) is satisfied whenever the ratio 
Phy p (£2)/ Phyp (£1) > 1, which, on this plot, corresponds 
to points lying above the fourth contour from the bottom. 



For Pusd = 1, we see that the maximum of Phyp is also 
equal to 1. However, when Pusd = 0, the maximum of 
Phyp is (N - 1)/N. 

We also see that the minimum value of Phyp occurs 
at A^o = N — 1, which gives us the tight lower bound 



(2.17) 



Phyp>j^ ((N - l)^Pu^+ y/N-(N-l)Pu SD ~ 

(2T8) 

When Pusd = 1, the minimum of Phyp is also 1, as we 
would expect. However, if Pusd = 0, then the minimum 
of Phyp is 1/N, which corresponds to a random guess 
of which of the N possible states the system has been 
prepared in. In fact, it is quite easily shown that in this 
case, all \ipj) are, up to a phase, equal to \x ). Here, 
the states are linearly dependent and this case is of no 
interest to us in the present context. 

Let us now use the bounds in (2.17) and (2.18) to 
demonstrate the existence of ensembles for which inequal- 
ities (2.3) and (2.4) hold. For the sake of simplicity, 
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we will consider only ensembles for which N = 3. We 
will choose £ 2 to be an ensemble which saturates the 
bound in (2.17) and £\ to be one which saturates the 
bound in (2.18), yet where PusD^i) = Pusd{£\) - e, 
for some positive, non-zero parameter e. This guar- 
antees that inequality (2.3) is satisfied. Figure 1 de- 
picts Phyp{£i)/ Phyp{£i) as a function of Pusd{£\) 
and e, where 0<e<PusD(£i) and N = 3. Satisfac- 
tion of the inequality (2.4) occurs for parameters where 
Pry p{£2) / Phy p{£\) > 1, and a large range of such pa- 
rameters in clearly visible in the figure. 

For example, let Pusd{£\) = 0.5 and Pusd{£2) = 
0.4. This gives e = 0.1 and inequality (2.3) is satis- 
fied. Evaluating the maximum correct hypothesis testing 
probabilities, we find that Phyp(£i) — 8/9^0.888 and 
Phyp{£2)~ 0.943 > Phyp{£i) which satisfies inequality 
(2.4). 



III. DISTINGUISHABILITY AND 
INFORMATION 

We have seen that different distinguishability measures 
impose different orderings on ensembles of pure quantum 
states. We considered linearly independent states and 
demonstrated this effect using the maximum probabili- 
ties of correct hypothesis testing and unambiguous state 
discrimination as distiguishability measures. 

This result has important implications for any situa- 
tion where we wish to transmit nonorthogonal states to 
send classical information. For example, in quantum key 
distribution, we may have a choice of sending states pre- 
pared in either ensemble £\ or ensemble £2- The distin- 
guishability of these ensembles may determine the poten- 
tial information available to an eavesdropper and also the 
rate at which key bits can be generated. It would there- 
fore be desirable to able to compare the distinguishabil- 
ity of both ensembles. What the results in the preceding 
show is that this cannot be done in any absolute sense; 
we must know in advance which distinguishability mea- 
sure is being used. However, we may not know which 
measure would apply to a potential eavesdropper since 
we may not know in advance which detection strategy 
they would employ. 

To put the results of the preceding section in context, 
it is helpful to consider a related finding recently made 
by Jozsa and SchlienzQ. They considered two ensembles 
of pure states, £\ and £27 with all states in each ensem- 
ble having equal a priori probabilities. They used the 
pairwise overlaps of pairs of states as a means of quanti- 
fying the distinguishability of each ensemble, and the von 
Neumann entropy Si of the density operator of ensemble 
£i as a means of quantifying information content. They 
showed that there exist ensemble pairs in which the pair- 
wise overlaps, and hence the distiguishability of £ 2 are 
less than those of £\ , yet where S2 > Sx, implying that 
the information content of £2 is greater than that of £ 1 . 
This is somewhat counterintuitive, as information and 



distinguishability are often regarded as interchangeable 
concepts. 

This finding suggests that there are distinctions to be 
made between information and distinguishability. That 
they are not entirely interchangeable is perhaps suggested 
by the fact that they arise in different contexts. In at- 
tempting to distinguish between a set of states, we are 
given only one copy, and must make optimum use of it. 
In information transmission however, we perform a col- 
lective measurement on a large number of states drawn 
from the same ensemble. The length of the strings and 
the subset chosen from the set of all possible strings is 
such that these strings are highly distinguishable. In- 
deed, they are chosen in such a way that the probability 
of failing to distinguish between the states perfectly can 
be made arbitrarily small. The information content is the 
number of bits of classical information that can be trans- 
mitted per signal in each string, where the signals obey 
the statistical constraints of the ensemble. In information 
transmission, we select strings which are sufficiently long 
so that we can neglect the distinguishability issue in one 
way. However, it is possible that this issue does return in 
some form to dictate the information content of a single 
signal in ways that are not currently well-understood. 

A conclusive distinction, which docs not apply to sit- 
uations in which all states have the same a priori proba- 
bilities, is that information content and distinguishability 
can vary in opposite ways when the a priori probabili- 
ties of the states are altered. To illustrate this, consider 
two ensembles, £\ and £2, where both consist of the pure 
states l^i) and IV^)- In £1, both states have equal a pri- 
ori probabilities equal to 1/2. However, in £2, they have 
unequal a priori probabilities p\ and P2 ■ In state discrim- 
ination, our goal is to determine the state. If one state is 
more probable than the other, then we can take advan- 
tage of this fact to tailor our measurement to weight it in 
favour of the state which has higher a priori probability, 
so that, on average, we will be able to improve our ability 
to determine the state. It follows that £2 ought to have 
higher distinguishability than £±. This is made explicit 
if we look back at either the Helstrom bound (2.5) or 
the Jaeger-Shimony bound (2.6). These both increase as 
A = \pi — P2I increases, confirming this expectation. 

However, if we wish to use states in ensemble £2 to send 
messages, then the constraint that one state must always 
be more probable than the other implies that, to regain 
our freedom in the message we might choose to send, we 
must send more signal states per message as A increases. 
Quantitatively, for £2, the information content is given 
by the von Neumann entropy of the ensemble density 
operator, which is equal to the binary entropy function 

52 = — xlog 2 x — (1 — x)log 2 (l — x) (3.1) 

where 

x = 1 (l + v /l-(l-A 2 )(l-|(^i|^)| 2 )) . (3.2) 
As A increases, x also increases which implies that, as 
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can easily be shown, 52 decreases. The extreme situa- 
tion is where one of the a priori probabilities, say pi, is 
equal to 1, in which case pi = 0. Here, we know in ad- 
vance what the state is, and so we can always determine 
it perfectly. We may say that this ensemble has perfect 
distinguishability. However, the sender has no freedom 
in which state to send and the state is always entirely 
predictable. For this reason, it is impossible to trans- 
mit any information using this ensemble, for which 52 is 
easily seen to be zero. 

As the above argument demonstrates, a clear distinc- 
tion between information content and distinguishability 
emerges if variation of the a priori probabilities of the 
states is permitted. However, it is not entirely clear why 
they should be distinct concepts when the a priori prob- 
abilities of all states are equal. Consequently, for en- 
sembles with equal a priori probabilities, the observation 
that information and distinguishability impose different 
orderings remains to be properly understood. 

The results in this paper suggest a potential explana- 
tion of this phenomenon. Given that it is not clear why 
information content should not be regarded as a distin- 
guishability measure when we restrict our attention to 
ensembles of states with equal a priori probabilities, let 
us assume that it is in fact a suitable distinguishability 
measure under these circumstances. The Jozsa-Schlienz 
effect could then be explained as a demonstration of the 
fact that, even for ensembles of equally- probable states, 
different distinguishability measures impose different en- 
semble orderings, which is what we have shown in this 
paper, using quantities which are indisputably suitable 
distinguishability measures. 

It remains to be understood why different distinguisha- 
bility measures impose different orderings. Perhaps, for 
two distinguishability measures to be non-trivially dis- 
tinct from one another, they must be sensitive to different 
aspects of ensembles of states, which can vary, with some 
degree of independence, from one ensemble to another. 
This seems reminiscent of the discovery recently made by 
Virmani and Pleniojll| that different, good measures of 
the entanglement of mixed states must order these states 
differently. 



IV. DISCUSSION 

We have shown in this paper that two of the most 
common measures of the distinguishability of states, the 
maximum probabilities of correct hypothesis testing and 
unambiguous state discrimination, are essentially incom- 
patible with each other. By this, we mean that they 
do not impose the same ordering on ensembles of pure 
states. It is possible to have one ensemble, £\, which is 
more distinguishable than another ensemble £2 when the 
states must be distinguished unambiguously, yet where 
£2 is more distinguishable than £ 1 if we wish to identify 
the state by minimum error hypothesis testing. In gen- 
eral, we cannot, in any absolute sense, characterise one 



ensemble of states as being more distinguishable than 
another. Distinguishability comparison must necessarily 
refer to a particular discrimination strategy. 

As with any interesting phenomenon, it is important 
to determine the conditions under which this effect can 
be demonstrated optimally. Here, we are faced with the 
fact that, for generic ensembles of states, the optimisa- 
tion problems which must be solved to obtain values of 
distinguishability measures are difficult to tackle analyt- 
ically. At the time of writing, the only ensembles for 
TV > 2 for which both of our chosen distinguishability 
measures can be calculated analytically are ensembles of 
equally-probable, linearly independent, symmetric pure 
states. For such ensembles, inequalities (2.16) and (2.17) 
give the extremal values of the correct hypothesis testing 
probability for an fixed value of the maximum probability 
of unambiguous state discrimination, and so the optimal 
conditions for demonstrating this effect for these ensem- 
bles are given by these relations. We also explored in 
some detail the case of N = 3. 

The effect we have demonstrated is somewhat reminis- 
cent of a related one recently discovered by Jozsa and 
Schlienz. These authors showed that one can construct 
an ensemble pair, £\,£2, where the distinguishability of 
£1, as measured by the pairwise overlaps of states, is 
greater than that of £2, yet where the information content 
of £2 , which is quantified by the von Neumann entropy, 
is greater than that of £\ . 

This is an important finding, not least because the con- 
cepts of distinguishability and information content are 
sometimes used interchangeably. We described how es- 
sential differences between these concepts do arise when, 
in moving from one ensemble to another, the a priori 
probabilities of the states are changed. However, if the a 
priori probabilities of all states in both ensembles are the 
same, it is by no means clear that, despite the contrast 
between the asymptotic nature of information and the 
'one shot' nature of distinguishability, information mea- 
sures cannot also serve as measures of distinguishability. 

Jozsa and Schlienz demonstrated their effect using 
such ensembles. What the results in this paper suggest 
is a possible interpretation of the Jozsa-Schlienz effect. 
They showed that, for ensembles of equally- probable 
pure states, information and a particular distinguishabil- 
ity measure suffer an ordering incompatibility problem. 
Does this imply that for such ensembles, information con- 
tent is not a suitable distinguishability measure? The 
results in this paper suggest that this is not necessarily 
the case, since different distinguishability measures suffer 
analogous ordering problems. It could then be the case 
that, for the ensembles considered by Jozsa and Schlienz, 
information content is in fact a suitable distinguishability 
measure and that the ordering incompatibility they dis- 
covered is a consequence of the fact that such problems 
arise with regard to generic distinguishbility measures. 
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